-
Notifications
You must be signed in to change notification settings - Fork 81
Use AWS Neuron SDK 2.26 #977
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
8dd321a
to
8bbe1f4
Compare
8bbe1f4
to
83956bd
Compare
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
4d3ca92
to
111d3ab
Compare
Note that the NEURON platform is now deprecated.
Since fixtures are always included, avoid import errors when specific packages required by a fixture are not available.
9a0fd7c
to
f9e78c5
Compare
uses: ./.github/actions/prepare_venv | ||
- name: Install optimum-neuron | ||
uses: ./.github/actions/install_optimum_neuron | ||
- name: Install datasets dependencies |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps rephrase it to "Install audio tests dependencies"
4e20238
to
8a9ac50
Compare
This avoids importing docker and openai for all tests
Note that github variables used for inputs can only be of type string. This is why the 'use_cuda' variable is not a boolean. Being able to configure the pytorch installation allows a specific workflow to install a specific torch version, or to use CUDA (some packages are not compatible with pytorch CPU version).
neuronx-distributed is always required.
a34b5bb
to
1238def
Compare
c098698
to
aca8f74
Compare
Some tests are failing with compel>=2.2.0
These tests hang with AWS Neuron SDK 2.26
aca8f74
to
930bf85
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
What does this PR do?
This bumps the AWS Neuron SDK version to 2.26.
This also bumps the torch version to 2.8, which in turns leads to vLLM to be updated to 0.10.2 (the first version supporting pytorch 2.8).
There are some remaining errors in:
FAILED tests/training/test_custom_modeling.py::test_custom_model_tie_weights - Failed: Test failed with SafetensorError: Error while deserializing header: incomplete metadata, file not fully covered
Flux test hangs